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AMENDMENTS T O THE CLAIMS 

Claims Pending: 

• At time of the Action: Claims 1-45 

• Amended Claims Current ly: Claims 27-29 and 31-35 

• Amended Claims Previously: Claims 1, 13-17, 19-24, 25, and 36 

• Cancelled Claims Previously: Claims 6, 18, 30, and 40 

• After this Response: Claims 1-5, 7-17, 19-29, 31-39, and 41-45 

The following listing of claims replaces all prior versions and listings of claims in the 
application. 

1 . (Currently Amended) A method for verifying relevance between terms and 
Web site contents, the method comprising: 

retrieving site contents from a bid URL; 

formulating expanded term(s) comprising at least one of semantically or contextually 
related to bid tenu(s), which are mined from a search engine in view of high-frequency of 
occurrence historical query terms; 

generating content similarity and expanded similarity measurements from respective 
combinations of the bid term(s), the site contents, and the expanded terms, wherein the 
similarity measurements indicate relatedness between respective ones of the bid term(s), site 
contents, or expanded terms; 

calculating category similarity measurements between the expanded terms and the 
site contents in view of a similarity classifier, wherein the similarity classifier has been 
trained from mined web site content associated with directory data; 
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calculating a confidence value from combined ones of multiple similarity 
measurements, wherein the combined ones comprise content, expanded, and category 
similarity measurements, wherein the confidence value provides an objective measure of 
relevance between the bid term(s) and the site contents; 

analyzing the confidence value to identify the bid term(s); and 
using the bid term(s) identified to increase traffic to a site to obtain site exposure; 
wherein generating the category si m ilarity measurements farther comprises: 
ex tracting features from Web site content associated with the directory data, th e 
features comprising at least one of title, meta data, body, hypertext link(s), visual feature^ 
and summarization by page layou t analysis infoitnation; 

reducing dimensionality of the features via featur e selection: 
categorizin g the features via a classifier mo del to generate the similarity classifier: 
generatin g respective term vectors from the bid teimfsl the site contents, and the 
expanded terms: and 

calculating similarity between the respective term vecto rs as a function of the 
similarity classifier to determine the category similarity me asurements. 

2. (Original) A method as recited in claim 1, wherein the similarity classifier is 
based on a statistical n-gram based naive Bayesian (N-Gram), a naive Bayesian (NB), 
support vector machine (SVM), a nearest neighbor (KNN), a decision tree, a co-training, or 
a boosting classification model. 
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3. (Original) A method as recited in claim 1. wherein formulating the expanded 
terms further comprises generating term clusters from term vectors based on calculated term 
similarity, the term vectors being generated from historical queries, each historical query 
having a high frequency of occurrence, the term clusters comprising the expanded terms. 

4. (Original) A method as recited in claim 1, wherein generating the content 
similarity measurements further comprise generating respective term vectors from the bid 
term(s) and the site contents, and calculating similarity between the respective term vectors 
to determine direct similarity between the bid tenn($) and the site contents. 

5. (Original) A method as recited in claim 1, wherein generating the expanded 
similarity measurements further comprises: 

generating respective term vectors from the bid tenn(s), the site contents, and the 
expanded terms; and 

calculating similarity between the respective term vectors to determine the expanded 
similarity measurements between the bid term(s) and the site contents. 

6. (Cancelled). 

7. (Original) A method as recited in claim 1 , wherein calculating the confidence 
value further comprises: 

training a combined relevance classifier with data of the form <term(s), Web site 
content, accept/reject> in view of an accept/reject threshold; 
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generating relevance verification similarity measurement (RSVM) feature vectors 
from the content, expanded, and category similarity measurements; and 

mapping multiple scores from the RSVM feature vectors to the confidence value via 
the combined relevance classifier. 

8. (Original) A method as recited in claim 1, wherein the method further 
comprises: 

caching the bid temi(s) and bid URL into a bidding database; 

responsive to receipt of an search query, determining if terms of the search query are 
relevant to the bid term(s) in view of a possibility that the terms of the search query may not 
exactly match the bid term(s); and 

if the term(s) of search query are determined to be relevant to the bid term(s), 
communicating the bid URL to the end-user. 

9. (Original) A method as recited in claim 1, wherein the method further 
comprises: 

determining proper name similarity measurements from the bid term(s) and site 
contents, the proper name similarity measurements indicating relatedness between any 
proper name(s) detected in the bid tenn(s) and the site contents in view a set of proper 
names; and 

wherein the combined ones of multiple similarity measurements comprise the proper 
name similarity measurements. 
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10. (Previously Presented) A method as recited in claim 9, wherein determining 
the proper name similarity measurements further comprises: 

responsive to detecting a proper name comprising at least one of the bid term(s) or 
the site contents, calculating a proper name similarity score as: 

Prop_Sim(temi, site contents), 

whrerein Prop_Sim(term, site contents) equals: one (1) when a term contains a 
proper name P, and site contents contains a conformable proper name Q; zero (0) when a 
term contains a proper name P, and site contents contains only unconformable proper 
name(s); or, zero-point-five (0,5). 

1L (Previously Presented) A method as recited in claim 1, wherein the method 
further comprises: 

determining that the confidence value is relatively low; and 

responsive to the determining, identifying one or more other terms comprising at 
least one of semantically or contextually related to the bid URL. • 

12, (Previously Presented) A method as recited in claim 11, wherein identifying 
further comprises: 

generating a set of term clusters from term vectors based on calculated term 
similarity, the term vectors being generated from search engine results of submitted 
historical queries, each historical query having a relatively low frequency of occurrence as 
compared to other query terms in a query log; and 
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evaluating the site contents in view of term(s) specified by the term clusters to 
identify at least one or more semantically or contextually related terms, the terms being the 
one or more other terms. 

13. (Currently Amended) A computer-readable storage m edium comprising 
computer-executable instructions for verifying relevance between teims and Web site 
contents, the computer-executable instructions comprising instructions for: 

retrieving site contents from a bid URL; 

formulating expanded term(s) comprising at least one of semantically or contextually 
related to bid term(s), which are mined from a search engine in view of high-frequency of 
occurrence historical query terms; 

generating content similarity and expanded similarity measurements from respective 
combinations of the bid term(s), the site contents, and the expanded terms, wherein the 
similarity measurements indicate relatedness between respective ones of the bid term(s), site 
contents, or expanded terms; 

calculating category similarity measurements between the expanded terms and the 
site contents in view of a similarity classifier, wherein the similarity classifier has been 
trained from mined web site content associated with directory data; 

calculating a confidence value from combined ones of multiple similarity 
measurements, wherein the combined ones comprise content, expanded, and category 
similarity measurements; 

providing an objective measure of relevance between the bid tenn(s) and the site 
contents as indicated by the confidence value; 
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analyzing the confidence value to identify the bid term(s); and 

using the bid term(s) identified to increase traffic to a site to obtain site exposure; 

wherein the computer-executable instructions for generating the category similarity 

measurements further comprise instructions for: 

extracting features from Web site content associated with the directory data, the 

features comprising a combination at least one of title, metadata, body hypertext link(s). 

visual featured, and summarization by page layout analysis information: 
reducing dimensionality of the features via feature selection: 
categorizing the features via a classifier model to generate the similarity classifier: 
generating respective term vectors from the bid term(s), the site contents, and the 

expanded terms: and 

calculating similarity between the respective term vectors as a function of the 
similarity classifier to determine the category similarity measurements. 



14. (Currently Amended) A computer-readable storage medium as recited in 
claim 13, wherein the similarity classifier is based on a statistical n-gram based naive 
Bayesian (N-Gram), a naive Bayesian (NB), support vector machine (SVM), a nearest 
neighbor (KNN), a decision tree, a co-training, or a boosting classification model. 

15. (Currently Amended) A computer-readable storage medium as recited in 
claim 13, wherein the computer-executable instructions for formulating the expanded terms 
further comprise instructions for generating term clusters from term vectors based on 
calculated term similarity, the term vectors being generated from historical queries, each 
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historical query having a high frequency of occurrence, the term clusters comprising the 
expanded terms. 



16. (Currently Amended) A computer-readable storage medium as recited in 
claim 13, wherein the computer-executable instructions for generating the content similarity 
measurements further comprise instructions for generating respective term vectors from the 
bid term(s) and the site contents, and calculating similarity between the respective term 
vectors to determine direct similarity between the bid term(s) and the site contents. 

17, (Currently Amended) A computer-readable medium storage as recited in 
claim 13, wherein the computer-executable instructions for generating the expanded 
similarity measurements further comprise instructions for: 

generating respective term vectors from the bid term(s), the site contents, and the 
expanded terms; and 

calculating similarity between the respective term vectors to determine the expanded 
similarity measurements between the bid term(s) and the site contents. 



18. (Cancelled). 

19. (Currently Amended) A computer-readable storage medium as recited in 
claim 13, wherein the computer-executable instructions for calculating the confidence value 
further comprise instructions for: 
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training a combined relevance classifier with data of the form <term(s), Web site 
content, accept/reject> in view of an accept/reject threshold; 

generating relevance verification similarity measurement (RSVM) feature vectors 
from the content, expanded, and category similarity measurements; and 

mapping multiple scores from the RSVM feature vectors to the confidence value via 
the combined relevance classifier. 

20. (Currently Amended) A computer-readable storage m edium as recited in 
claim 13, wherein the computer-executable instructions further comprise instructions for: 

caching the bid tenn(s) and bid URL into a bidding database; 

responsive to receipt of an search query, determining if terms of the search query are 
relevant to the bid term(s) in view of a possibility that the terms of the search query may not 
exactly match the bid tenn(s); and 

if the term(s) of search query are determined to be relevant to the bid term(s), 
communicating the bid URL to the end-user. 

2L (Currently Amended) A computer-readable storage m edium as recited in 
claim 13, wherein the computer-executable instructions further comprise instructions for: 

determining proper name similarity measurements from the bid term(s) and site 
contents, the proper name similarity measurements indicating relatedness between any 
proper name(s) detected in the bid term(s) and the site contents in view a set of proper 
names; and 



10 of 22 



Lee* Hayes, pllc 

MSK>NSE TO OFFICE ACTION DATED OCTQBLR 12. 2006 



ATTORNEY DOCKET NO. MS1-1 891 US 
Serial No, I0/*20,|« 



PAGE 10122 * RCVD AT 51912007 6:37:05 PM [Eastern Daylight Time] * SVR:USPTO-EFXRF-2/13 * DNIS:2734035 * CSID: * DURATION (mm-ss):05-30 



MAY 09 2007 15=45 FR TO 15712734035 P. 11/22 



wherein the combined ones of multiple similarity measurements comprise the proper 
name similarity measurements, 

22. (Currently Amended) A computer-readable storage medium as recited in 
claim 21, wherein the computer-executable instructions for detennining the proper name 
similarity measurements further comprise instructions for: 

responsive to detecting a proper name comprising at least one of the bid term(s) or 
the site contents, calculating a proper name similarity score as: 

Prop_Sim(teim, site contents) and 

wherein Prop_Sim(term, site contents) equals: one (1) when a term contains a proper 
name P, and site contents contains a conformable proper name Q; zero (0) when a term 
contains a proper name P, and site contents contains only unconformable proper name(s); or, 
zero-point-five (0.5). 

23. (Currently Amended) A computer-readable storage medium as recited in 
claim 13, wherein the computer-executable instructions further comprise instructions for: 

determining that the confidence value is relatively low; and 

responsive to the determining, identifying one or more other terms that are 
semantically and/or contextually related to the bid URL. 

24. (Currently Amended) A computer-readable storagemedium as recited in 
claim 23, wherein the computer^executable instructions for identifying further comprise 
instructions for: 
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generating a set of term clusters from term vectors based on calculated term 
similarity, the term vectors being generated from search engine results of submitted 
historical queries, each historical query having a relatively low frequency of occurrence as 
compared to other query terms in a query log; and 

evaluating the site contents in view of term(s) specified by the term clusters to 
identify one or more semantically and/or contextually related terms, the terms being the one 
or more other terms. 

25. (Currently Amended) A computing device for verifying relevance between 
terms and Web site contents, the computing device comprising: 
a processor; and 

a memory coupled to the processor, the memory comprising computer-program 
instructions executable by the processor for: 

retrieving site contents from a bid URL; 

formulating expanded term(s) comprising at least one of semantically or contextually 
related to bid term($)> 

generating content similarity and expanded similarity measurements from respective 
combinations of the bid term(s), the site contents, and the expanded terms, wherein the 
similarity measurements indicate relatedness between respective ones of the bid term(s), site 
contents, or expanded terms; 

calculating a confidence value from combined ones of multiple similarity 
measurements, wherein the combined ones comprise content, expanded, and category 
similarity measurements; 
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providing an objective measure of relevance between the bid teim(s) and the site 
contents as indicated by the confidence value; 

analyzing the confidence value to identify the bid term(s); and 
using the bid term(s) identified to increase traffic to a site to obtain site exposure^ 
wherein the computer-executable instructions for generating the cateeory similarity 
measurements further comprise instructions for: 

extracting features from Web site content associated with the directory data, the 
features comprising a combination of at least one of title, metadata, body, hypertext linkfsV 
visual featured, and summarization bv page layout analysis information; 
reducing dimensionality of the features via feature selection: 
categorizing the features via a classifier model to generate the similarity classifier; 
generating respective term vectors from the bid term(s). the site contents, and the 
expanded terms: and 

calculating similarity between the respective term vectors as a function of the 
similarity classifier to determine the category similarity measurements. 

26. (Original) A computing device as recited in claim 25, wherein the similarity 
classifier is based on a statistical n-gram based naive Bayesian (N-Gram), a naive Bayesian 
(NB), support vector machine (SVM), a nearest neighbor (KNN), a decision tree, a co- 
training, or a boosting classification model. 

27. (Currently Amended) A computing device as recited in claim 25, wherein the 
computer-executable stored instructions for formulating the expanded terms further 

ATTORNEY DOCKET NO. M$|. I891US 
Serial No. IO/82&.1&? 

PAGE 13/22 * RCVD AT 51912007 6:37:05 PM [Eastern Daylight Time] * SVR:USPTO-EFXRF-2/13 1 DNIS:2734035 * CSID: * DURATION (mm-ss):05-30 



Lee St Hayes. mjj£ 13 Of 22 

RESPONSE TO OFFICE ACTION DATED OCTOBER 1 2, 2006 



MAY 09 2007 15=46 FR 



TO 15712734035 P. 14/22 



comprise instructions for generating term clusters from term vectors based on calculated 
term similarity, the term vectors being generated from historical queries, each historical 
query having a high frequency of occurrence, the term clusters comprising the expanded 
terms. 

28. (Currently Amended) A computing device as recited in claim 25, wherein the 
computer-executable storedjnstmctions for generating the content similarity measurements 
further comprise instructions for generating respective term vectors from the bid tenn(s) and 
the site contents, and calculating similarity between the respective term vectors to determine 
direct similarity between the bid term(s) and the site contents. 

29. (Currently Amended) A computing device as recited in claim 25, wherein the 
computer-executable stored i nstructions for generating the expanded similarity 
measurements further comprise instructions for; 

generating respective term vectors from the bid term(s), the site contents, and the 
expanded terms; and 

calculating similarity between the respective term vectors to determine the expanded 
similarity measurements between the bid term(s) and the site contents. 

30. (Cancelled). 
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3 1 . (Currently Amended) A computing device as recited in claim 25, wherein the 
computer-executable stored_instructions for calculating the confidence value further 
comprise instructions for: 

training a combined relevance classifier with data of the form <term(s), Web site 
content, accept/reject> in view of an accept/reject threshold; 

generating relevance verification similarity measurement (RSVM) feature vectors 
from the content, expanded, and category similarity measurements; and 

mapping multiple scores from the RSVM feature vectors to the confidence value via 
the combined relevance classifier. 

32. (Currently Amended) A computing device as recited in claim 25, wherein the 
computer-executable storedjnstructions further comprise instructions for: 

determining proper name similarity measurements from the bid tenn(s) and site 
contents, the proper name similarity measurements indicating relatedness between any 
proper name(s) detected in the bid tenn(s) and the site contents in view a set of proper 
names; and 

wherein the combined ones of multiple similarity measurements comprise the proper 
name similarity measurements. 

33. (Currently Amended) A computing device as recited in claim 32, wherein the 
computer-executable stored_instructions for determining the proper name similarity 
measurements further comprise instructions for: 
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responsive to detecting a proper name comprising at least one of in the bid term(s) or 
the site contents, calculating a proper name similarity score as: 

Prop_Sim(term, site contents) and 

wherein PropJSimCterm, site contents) equals: one (1) when a term contains a proper 
name P, and site contents contains a conformable proper name Q; zero (0) when a term 
contains a proper name P, and site contents contains only unconformable proper name(s); or, 
zero-point-five (0,5). 

34. (Currently Amended) A computing device as recited in claim 25, wherein the 
computer-executable stored i nstructions further comprise instructions for: 

determining that the confidence value is relatively low; and 
responsive to the determining, identifying one or more other terms comprising at 
least one of semantically or contextually related to the bid URL. 

35. (Currently Amended) A computing device as recited in claim 34, wherein the 
computer-executable stored instructions for identifying further comprise instructions for: 

generating a set of term clusters from term vectors based on calculated term 
similarity, the term vectors being generated from search engine results of submitted 
historical queries, each historical query having a relatively low frequency of occurrence as 
compared to other query terms in a query log; and 
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evaluating the site contents in view of term(s) specified by the term clusters to 
identify at least one or more semantically or contextually related terms, the terms being the 
one or more other terms. 

36. (Currently Amended) A computing device for verifying relevance between 
terms and Web site contents, the computing device comprising: 
retrieving means to obtain site contents from a bid URL; 

formulating means to identify expanded term(s) comprising at least one of 
semantically or contextually related to bid term(s), 

generating means to create content similarity and expanded similarity measurements 
from respective combinations of the bid term(s), the site contents, and the expanded terms, 
wherein the similarity measurements indicate relatedness between respective ones of the bid 
term(s), site contents, or expanded terms; 

calculating means to determine category similarity measurements between the 
expanded terms and the site contents in view of a similarity classifier, wherein the similarity 
classifier has been trained from mined web site content associated with directory data; 

calculating means to generate a confidence value from combined ones of multiple 
similarity measurements, wherein the combined ones comprise content, expanded, and 
category similarity measurements, wherein the confidence value provides an objective 
measure of relevance between the bid term(s) and the site contents; 

analyzing means to analyze the confidence value to identify the bid term(s); and 

increasing means to increase traffic to a site by using the bid term(s) identified: 

wherein the generating means further comprise: 
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extracting means to obtain features from Web site content associated with the 
directory data, the features comprising a combination of at least one of title, metadata, body. 
hypertext link(s), visual featurefs V and sum marization bvpage layout analysis information: 
reducing means to lessen dimensionality of the features via feature selection: 
categorizing means to organize the features via a classifier model to generate the 
similarity classifier: 

generating means to cre ate respective term vectors from the bid termfsl. the site 
contents, and the expanded terms: and 

calculating means to identify similarity between the respective term vectors as a 
function of the similarity classifier to determine the category similarity measurements. 

37, (Original) A computing device as recited in claim 36, wherein the computer 
formulating means further comprise generating means to create term clusters from term 
vectors based on calculated term similarity, the term vectors being generated from historical 
queries, each historical query having a high frequency of occurrence, the term clusters 
comprising the expanded terms. 

38. (Original) A computing device as recited in claim 36, wherein the generating 
means further comprise creating means to generate respective term vectors from the bid 
term(s) and the site contents, and calculating similarity between the respective term vectors 
to determine direct similarity between the bid term(s) and the site contents. 
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39. (Original) A computing device as recited in claim 36, wherein the generating 
means further comprise: 

creating means to generate respective term vectors from the bid teim(s), the site 
contents, and the expanded terms; and 

calculating means to determine similarity between the respective term vectors to 
determine the expanded similarity measurements between the bid tenn(s) and the site 
contents. 

40. (Cancelled). 

41. (Original) A computing device as recited in claim 36, wherein the calculating 
means further comprise: 

training means to train a combined relevance classifier with data of the form 
<term(s), Web site content, accept/reject> in view of an accept/reject threshold; 

generating means to generate relevance verification similarity measurement (RSVM) 
feature vectors from the content, expanded, and category similarity measurements; and 

mapping means to correlate multiple scores from the RSVM feature vectors to the 
confidence value via the combined relevance classifier. 

42. (Original) A computing device as recited in claim 36, wherein the computing 
device further comprises: 

determining means to determine proper name similarity measurements from the bid 
tenn(s) and site contents, the proper name similarity measurements indicating relatedness 
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between any proper name(s) detected in the bid term(s) and the site contents in view a set of 
proper names; and 

wherein the combined ones of multiple similarity measurements comprise the proper 
name similarity measurements. 

43. (Previously Presented) A computing device as recited in claim 42, wherein 
the determining means to determine the proper name similarity measurements further 
comprise responsive to detecting a proper name comprising at least one of the bid term(s) or 
the site contents, calculating means to calculate a proper name similarity score. 

44. (Previously Presented) A computing device as recited in claim 36, wherein 
the computing device further comprises: 

determining means to determine that the confidence value is relatively low; and 
responsive to the determining, identifying means to identify one or more other terms 
comprising at least one of semantically or contextually related to the bid URL. 

45. (Previously Presented) A computing device as recited in 44, wherein the 
identifying means further comprise: 

generating means to generate a set of term clusters from term vectors based on 
calculated term similarity, the term vectors being generated from search engine results of 
submitted historical queries, each historical query having a relatively low frequency of 
occurrence as compared to other query terms in a query log; and 
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evaluating means to evaluate the site contents in view of term(s) specified by the 
term clusters to identify at least one or more semantically or contextually related terms, the 
terms being the one or more other terms. 
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